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Abstract 


Current SSME high pressure fuel turbopump problems have generated a 
desire to analyze the flowfield of rotating machinery. The status of CFD 
has reached the point that soon the capability to solve unsteady three- 
dimensional viscous flowfields will be at hand. The work presented here 
involves upgrading the computational efficiency of an operational three- 
dimensional algorithm. The modifications include algorithm development, 
algorithm approximation and acceleration, and special coding optimiza- 
tions. The overall result of these modifications has reduced processing 
time by nearly 80$. 


Introduction 


Presently, there exists a need for a robust and accurate computa-- 
tional algorithm to compute three-dimensional flow over a rotor-stator 
configuration. An internal flowfield analysis of rotating machinery 
necessitates the use of an advanced time-accurate three-dimensional 
algorithm with dynamic grid capability. Additionally, this algorithm 
must be computationally efficient, i.e., be capable of producing solu- 
tions of engineering accuracy to real world flows in a reasonable length 
of time. Until recently, the CFD community didn't even consider routine 
calculations of a typical three-dimensional unsteady viscous flow. This 
was primarily due to the large number of cells required to resolve the 
pertinent flowfield phenomena. The domain must be subdivided into sev- 
eral million cells, requiring approximately 30 computer words each, 1 the 
result of which is excessive computational processing times using pres- 
ent day codes. The introduction of the Cray 2 supercomputer and the 
availability of Cray's SSD storage have alleviated much of the concern 
regarding storage for those with access to this state-of-the-art equip- 
ment. 

The initial step toward developing a code which can fulfill the 
requirements stated above begins with the choice of a baseline 
algorithm. The baseline algorithm chosen for this endeavor is that de- 
scribed in Ref. 2. This algorithm has been shown to yield satisfacto- 
rily resolved solutions of the flow past a variety of geometries in many 
flow regimes. The baseline computer algorithm, herein referred to as 
BMULE, is lacking in the computational speed required to accomplish the 
task previously mentioned. The storage problem is of a lesser concern 
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in light of the recent advances in computer hardware introduced by Cray 
Research. An estimate at the onset of this project placed the memory 
requirement at approximately 100,000,000 words and CRAY X-MP CPU time 
at approximately 400 hours for a typical three-'dimensional rotor-stator 
configuration flow analysis. It is evident that improvements in the 
baseline algorithm are in order, before any three-dimensional viscous 
rotating machinery flowfield solutions will be feasible. 

This first year effort lays the ground work for an algorithm, the 
purpose of which will be to perform the tasks outlined in the first 
paragraph of this section. The intention here is to improve an opera- 
tional computer algorithm, keeping in mind the basic goal of approaching 
the computational efficiency required if one ever expects to produce 
routine flow analysis of rotating machinery. 

The material presented in this report entails three basic 
approaches to enhance computational efficiency. These are: 1) algorithm 
improvement, 2) acceptable algorithm approximations, and 3) coding modi- 
fications. The first section of this report briefly describes the equa- 
tions and the baseline algorithm. This is followed by the note-worthy 
accomplishments in each of the specified approaches, given in the second 
through the fourth sections. The results of this first year effort are 
then presented in the fifth section. Finally, the sixth section presents 
the conclusions of this report based on the year's findings. Also, pre- 
sented are suggestions for future development in the field of rotating 
machinery flow analysis. 
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Equations & Algorithm 


The present work is based on the equation set developed in Ref. 3 
and the algorithm developed in Ref. 2. Consider the fully discretized 
linearized integral form of the flux split fluid dynamic equations writ- 
ten as 


[I + At(< 5.A* + 5.A* + <5 jB* + 6 jB~ + 6 k C* + 6 k C*)]AQ n = -AtR n (1) 


where 


AQ n = Q n+1 - Q n 


(2) 


and the flux jacobians 


. + 1 9F* ^n 

A • l sr) 


(3) 


rt“ f 3F >n 

# ' (ar 1 


B* . ($§-)" 


with 


R“ - U'jK + 6^ 


6jG + + 6 jG + o k H + + ^ 


for inviscid flow analysis, or 

R n = (6 i F + + 5 jF" + 6 jG + + 6jG~ + 6 k H + + 6 k H~ + S n ) 


/ li \ 
V t J 


(5) 


where 



for viscous flow analysis. For a detailed review of F + , F~, G + ,...etc., 
the reader is referred to Refs. 2 and 3. F y , G y , and H y (not mentioned 
in Refs. 2 and 3) represent the effects of the viscous stresses acting 
on a cell. They are implemented in an explicit fashion and have been 
shown not to markedly affect the speed of the baseline algorithm. 

In the interest of limited computational funds, it was deemed im- 
practical to operate with the viscous stresses activated when running 
the test cases presented herein. This is solely due to an inabililty to 
adequately resolve a flowfield when restricted to coarse mesh funding. 
It should be noted that the additional computations per cell introduced 
hy the viscous stresses are minimal. 
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Algorithm Improvement 


The matrix structure of Eq. (1 ) tends to be cumbersome and difficult 
to invert, not to mention very costly. Presently, two well-known methods 
are used to approximate Eq. (1), approximate factorization (AF) and re- 
laxation methods. As in Ref. 2, the AF approach is preferred to a 
relaxation method. 

There exist several methods for approximately factoring Eq. (1). 
Some retain the full implicit nature of the system, yet they require 
relatively sophisticated solution algorithms. Others possess only a 
point implicit nature and lend themselves to much simpler solution 
algorithms. These methods are inevitably multi-step, upper or lower 
block triangular in matrix structure with the attractive feature of re- 
quiring point simultaneous solutions and backward or forward substitu- 
tion. 

Two of the attractive factoring schemes were scrutinized during the 
course of this project. The first of these is that which the baseline 
algorithm utilized as a solution procedure. It will be referred to here 
as the six factor(6F) scheme. This scheme involves the factorization of 
Eq. (1 ) based on the sign of each spatial direction in computational 
space yielding the following 

(I+AT6 i A-)(I+AT6 i A*)(I+AT6jB-)(I+AT6jB 7 )(I+AT5 k C-)(I+AT6 k c" )AQ n =-AtR n (7) 
The solution of Eq. (7) can be carried out as follows: 

(I + AtS^-JX 1 = -At R n (8a) 

(I + At6 i A~)X 2 = X 1 (8b) 
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(I + Ax6jB*)X 3 = X 2 (8c) 

(I + At6jB")X 4 = X 3 (8d) 

(I + Ax6 k C*)X 5 = X 4 (8e) 

(I + Ax6 k c")X 6 = X 5 ( 8f ) 

AQ n = X 6 ( 8g) 

An alternate factorization is based on the like sign of all three 
spatial directions in computational space yielding the following 

[I + Ax(<5^A* + 6jB* + 6 k C*)][I + Ax(6^A* + 6jB* + 6 k C*)]AQ n = -AtR 11 

This factoring scheme is referred to as the two factor (2F) scheme, the 
solution of which can be carried out as follows 

[I + Ax^A* + 6 jB* + 6 r (> ) ]X^ = ~AxR n (9a) 

[I + Ax(6 i A* + 6 jB* + 6 k C r )]X 2 = X 1 (9b) 

AQ n = X 2 (9c) 

The error introduced via the 6F scheme has the form 

{ At 2 5 jA • 6 ^A* ( I + a) ( T + ft) + rAx(<S*A* + 6 jA*)]CB + a (I + B)3 + 

AT 2 6 j B-6 j B*(I + B) + C At (6 jB* + 6jB*)]B + Ax 2 6 k C *6 k C • }AQ n (10) 

where 

a « [At (6 jB* + 6 jB- ) + At 2 6 jB* 6 jB* ] (11a) 

6 = [Ax(6 k C* + 5 k c") + Ax 2 6 k C*6 k c“] (11b) 
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An examination of the terms indicates the error to be of higher order of 
accuracy than that of Eq.(1), consequently the overall order of the 
scheme is maintained. The error introduced via the 2F scheme has the 
form 

CAt(5 i A* + 6 jB* + 6 k C- ) ][Ax (6 iA^ + 5jB- + 5 k C~)]AQ n (12) 

Inspection of this error also indicates it to be of higher order of 
acccuracy than Eq.(1), again the overall order of the scheme is maim 
tained. 

Judging from Eq. (10) and Eq. (12), it seems plausible to expect 
the 2F scheme to be somewhat superior to the 6F scheme. Although Eq. (1) 
represents an unconditionally stable scheme, due to the error introduced 
by the factoring process a restriction may be imposed on the time-step 
to maintain stability. A stability analysis of both schemes was pres- 
ented by Anderson, et al., 1 * the results of which are given in Fig. 1. 
For the Courant numbers sampled, the figure indicates the 6F scheme to 
be more restrictive than the 2F scheme regarding maximum allowable 
time«step. It is encouraging to note that the 2F scheme seems to retain 
the favorable quality of unconditional stability. 

There are advantages and disadvantages to both factoring schemes. 
For example, the 2F scheme tends to have an appetite for in-core memory 
stemming from the simultaneous storage of at least three flux jacobians, 
whereas the 6F scheme sequentially overwrites the same memory cells al- 
located for a single flux jacobian. Another example is the number of 
linear systems solved per cycle; the 6F scheme requires six compared to 
only two for the 2F scheme. Based on all the aspects of both schemes, 
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the 2F scheme seems to stand out as a superior factorization method. As 
a consequence of this analysis, the 2F scheme was implemented in all 
following results unless indicated otherwise. 

Although the 2F scheme is superior to the 6F scheme, there is still 
the ever present AF error ( Eq .(12)) introduced by this scheme. 
Presently, there is much interest regarding the elimination of this AF 
error. One method to eliminate the AF error of the 2F scheme is to alter 
the second step as follows 

{I + At [I + AtUjA* + 6 jB* + 5 k C- ) D' 1 [6 ± A~ + 6jB^ + <5 k C~]}X 2 = X 1 (13) 

Total elimination of the AF error is achieved by the premultiplication 
of the At term in the second step by the inverse of the first step op* 
erator . 

The difficulty with this theory is that it involves the inversion 
of a matrix of difference operators. An approximate version of this in- 
verse method can be formulated by substituting the following matrix for 
the proposed inverse of the first step 

[I + Ax(6 i A + + 6 jB + + 5 k C + );f 1 (14) 

Notice the operator dot (•) has been removed leading to a method which 
could readily be investigated. Although Eq.(13) should completely 
eliminate the AF error, the approximate version (Eq.(l4)) falls short of 
this goal. Evidently Eq.(l4) retains enough AF error to adversely affect 
the solution process. This method, when condensed to a single line equa- 
tion, closely mimics a relaxation scheme, which was avoided from the 
beginning. 


9 



Approximation and Acceleration 


The previous section was concerned with reducing or eliminating the 
error introduced as a consequence of factoring Eq.(1). This section ap- 
proaches the problem from what could be considered an opposite view- 
point, i.e., what can one get away with and still obtain an acceptable so- 
lution. The primary objective here is to shorten and/or simplify the 
path to the desired solution. 

First, consider a method which substitutes the true flux jacobians 
(TFJ) given by Eq.(3) with their corresponding spectral radii. A matrix 
of the following form is used in lieu of the TFJ matrices 

X SR,K 1 (15) 

where 

Aoo ^ * max ( | A^|) 06) 

Oil } 11 n • 

with 

A^ = eigenvalues of K, i = 1,2, 3, 4, 5 
and K is either A + , A - , B + , B", C + or C”. In Eq. (15), I has the usual 
meaning of the identity matrix. 

The inclusion of the matrices given by Eq.(15) yields the following 


[I 

+ 

AtCSjA I- 

+ SA I- + 5 k A 10] 

X 1 - 

-AxR n 

(18a) 



SR, A 

J SR,B SR,C 




[I 

+ 

At(<5.A _I* 

+ 6,A I- + 6 k A „lO] 

X 2 - 

X 1 

(18b) 



sr,a“ 

J SR,B~ SR,c"* 








AQ n « 

X 2 

(18c) 


10 


Equations (18) represent an uncoupled system thus eliminating the need 
for the simultaneous solution at a point. The solution process is sim- 
plified to that of a backward and a forward substitution. Thus far eve- 
rything sounds great, the drawback happens to be multi-faceted. 

The first difficulty is the isolation of the dominant eigenvalue. 
This can be accomplished using an iterative scheme referred to as the 
power method.-* Although it is a simple iterative routine, it tends to be 
costly. Secondly, the stability of the spectral radius method is much 
more restrictive than that of the 2F scheme, as expected. This degrada- 
tion in stability imposes a stringent time-step limitation. Together, 
the two drawbacks cited are enough to exclude the spectral radius method 
as an integral part of the solution algorithm. 

Another approximation technique involves the infrequent updating of 
the flux jacobians (flux jacobian freezing). The method is quite simple 
assuming a large quantity of in-core memory is available (or rapid 
transfer out-of-core storage devices are accessible). The basic approach 
is to calculate, store, and periodically update the flux jacobian matri- 
ces. The period of acceptable flux jacobian "freezing" varies from flow 
to flow. In many instances (steady state solutions), the flux jacobians 
need only be calculated once. 

This approach of flux jacobian freezing cuts per cycle computation 
time nearly in half depending, of course, on the frequency of updating. 
Although run time is significantly reduced, the storage requirement is 
doubled. The availability of abundant Cray SSD storage encouraged the 
inclusion of this approach as an integral part of the solution algo- 
rithm. 
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The final topic in this section, the use of multiple grids, is both 

an approximation and an acceleration technique. The multi-grid 

technique used here is that of a full approximation scheme (FAS). The 

theory is that the low frequency (LF) errors which are not damped well 

by the solution algorithm on a fine grid appear as high frequency (HF) 

errors on coarser grids and subsequently can be damped out on these 

grids. This is assuming, of course, that the solution algorithm damps HF 

errors well. This scheme can be applied to progressively coarser grids 

to damp most of the components of the error. For an in-depth study of 

the FAS method as it is applied to a similar finite volume code, the 

£ 

reader is referred to the work of Anderson. 

The basic approach involves solving an error equation on increas- 
ingly coarse meshes. This error equation and background material are 
given in Ref. 6 thus they will not be repeated here. The coarsening can 
be accomplished in a variety of ways. The simplest procedure involves 
starting with a fine grid then eliminating every other grid point in 
each computational direction to construct a coarser grid. The use of 
multi’-grid requires increasing the total storage by no more than 25%. 
Also, the computations on the coarser grids are relatively insignificant 
because each coarse grid contains only 1/8 as many points as it’s parent 
grid. 

Reference 6 gives a detailed outline of the steps involved in the 
FAS multi-grid V cycle which is used in this study. Various investiga- 
tive studies were performed to access the benefits derived from the 
multi-grid procedure. An ONERA M6 wing (49x9x9) was used as a test case 
to examine the multi-grid characteristics; such as, acceleration of con- 
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vergence, additional memory penalties, runtime per cycle per grid point, 
etc. In Fig. 2, the effect of the multi->grid procedure on reducing the 
number of cycles to reach a given level of convergence is shown. The 
asymptotic spectral radius (convergence rate) for the single grid was - 
.95, whereas that using a 2-level multi-grid V cycle with 3 iterations 
on the coarse grid was - .87. From the data collected, a computational 
savings of 32^ was realized using the multi-grid procedure. 

Thus far, it is evident that multi-grid is beneficial for steady- 
state solutions. It also has applications to unsteady solutions. 
Jesperson has successfully demonstrated the use of a time-accurate 
multi-grid algorithm.^ Recently, significant savings have been realized 
using a time-accurate multi-grid approach for a harmonically plunging 

Q 

airfoil. It is expected that multi-grid will become a useful aspect of 
rotor-^stator flow analysis, consequently it has been included in the 
solution algorithm. 
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Coding Developments 


The following section deals with computer algorithm modifications 
which do not alter the solution, yet represent significant advances. 
Some of these were developed by colleagues and integrated into the flow 
code. Several less significant code modifications have also been imple- 
mented over the past year. With the exception of the freezing of the 
flux jacobians, code developments have resulted in the largest reduction 
in code length and execution time. 

The first of the significant developments is referred to here as 
ribbon vector dynamic allocation, a concept developed by Belk. 3 The 
principle of ribbon vector storage is frequently used to compress stor- 
age to avoid wasting memory space (achieve a minimum core requirement). 
It functions well with modular codes because the bulk memory requirement 
can be allotted in the main calling routine, and floating dimensions can 
be used in the subroutines. Belk has demonstrated the use of this prin- 
ciple, coupled with the flexibility of the Cray to allow user defined 
dynamic memory management, to enable a dynamic mesh size. The plan is to 
use this in connection with a multi-block procedure to minimize in-core 
requirements. A version similar to trial proposed by Belk has been inte^ 
grated into this solution algorithm. 

This leads to another concept exhaustively investigated by Belk. 
It involves the segmenting of the computational domain into relatively 
compact computational blocks. The impetus for such an approach emanates 
from the difficulties encountered when generating a single grid about a 
complex geometrical shape such as a wing-pylon-store or the blades in 
rotating machinery. An additional attractive feature of multi-block is 
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the ability to sequentially solve several smaller flowfields comprising 
one larger, nearly insurmountable flowfield. As mentioned before, this 
section involves coding mods which do not alter the solution. 
Frequently, some slight degradation of a solution is acceptable when 
dealing with multiple block solutions. 

Belk has shown that some liberties can be taken regarding the in- 
ter-block transfer of data while still maintaining acceptable solution 
development. Based on these findings, a multi-block procedure is 
expected to function well in the analysis of a rotor-stator configurati- 
on. Related work has indicated that multi-block dynamic grid procedures 
are not difficult to implement. The detailed analysis of this procedure 
is on-going relative to its quantitative results and the influence of 
inter-block communication in rotating machinery analysis. 

The final contribution to code development was an attempt to opti- 
mize the solution procedure of the 2F scheme. The 2F scheme involves a 
point simultaneous solution and backward (or forward) substitutions. The 
substitution portion poses quite a problem when attempting to utilize 
vector hardware to it's fullest. The difficulty, referred to as a vector 
dependency^ (or a recursion problem), occurs when all the elements of a 
computational vector are not independent. For example, the equation 

x i = X i _ 1 i = 1 (19) 

involves a vector dependency when coded, because the i tn element depends 
on the (i - 1) th element. A method has been developed in order to 
circumvent this problem in a multidimensional space. The concept of 
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this method was originally conceived by Belk 10 and has no previous pub- 
lication. 

To begin, consider the example of a two-dimensional 3 pace in Fig. 3 
and the following equation 


X i,j X i-1 , j + X i , j-1 i“1»***» N i» 


( 20 ) 


If one attempts to logically construct computational vectors using the 
elements lying on any computational line, as shown, a vector dependency 
will be present. Thus the equation can only be solved in a much slower 
scalar mode. Now consider Fig. 4 and Eq. (20). If computations are car- 
ried out along the computational vectors (diagonal lines) as shown, the 
equation can be solved in a vector mode. This is because all the 
elements in a single computational vector are independent of one 
another. The elements of each computational vector are given by the line 

i + j = C (21) 

where C is the vector number, ranging from 0 to (N* + N i ) . In addition, 
the vector length varies from 1 to min{(N i +1 ) ,(Nj+1 ) }. This is a very simple 
procedure in two dimensions, now for the easy extension to three-dimen- 
sions. 

Consider the three-dimensional analogy to Eq. (20) 


X i,j,k X i-1,j,k + X i , j-1 ,k + X i,j,k-1 


i = 1,...,N i 
j = 1 , . . . , Nj (22) 

k — 1,..., 


Referring to Fig. 5, a similar procedure can be followed in order to 
vectorize a three-dimensional problem. The points lying on a diagonal 
plane constitute the computational vector with independent elements. 
These points are given by the equation of the plane 
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i + j + k = C 


(23) 


where C is the vector number, ranging from 0 to (N^ + Nj + N^). In addi- 
tion, the vector length varies from 1 to min{ (N^+1 ) (Nj + 1 ) , (Nj + 1 ) (N k +1 ) , 
(Nj+1 ) (N k +1 ) } . This procedure for vectorizing a backward (or forward) 
substitution has become an integral part of the solution algorithm. A 
timing analysis is difficult to obtain due to the maximum vector length - 
depending on the mesh structure. It is anticipated that the significance 
of the diagonal plane method will increase as the mesh sizes increase. 
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Results 


The year's progress can be accessed by comparing the status of the 
code of September 1985 to that of September 1986. The comparison can be 
made using the ONERA M6 wing (49x9x9) case presented in Table 1 of Ref. 
2. A comparable flow trace was generated using the modified version of 
BMULE, refer to Table 1b. Although a direct comparison cannot be made 
regarding individual subroutines, a direct comparison can be made 
between subroutine groups serving the same function. Consider, for exam- 
ple, the following list of modified BMULE subroutines and the cor- 
responding BMULE counterparts. 


September 1986 


September 1 985 


Performance Ratio 


1986 


METRIC 

BC 

RESID + -(3x9x9) x FLUX 
TSTEP 

6 X CMAT 
2 x D00 


METRIC 

.86 

BC 

2.09 

FLUX 

.43 

EIGENV 

1 .14 

2 x AE 

2 x BE 
2 x CE 

DOOIF 

DOOIB 

DOOJF 

1 .00 

DOOJB 

DOOKF 

DOOKB 

.25 
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The performance ratios should be examined in light of the percent 
of total runtime 3pent in each subroutine, given in Tables 1, This is 
accounted for when one examines the overall code performance ratio, 
found to be .21. Thus the modified version operates nearly 5 times as 
efficient as the previous version. This is mainly attributed to flux 
jacobian freezing and the diagonal plane solution path. The code now op- 
erates at 2.1 x 10~ 5 seconds per grid point per cycle. 

The code was used to analyze the flowfield of a typical cascade 
geometry.^ 1 Fig. 6 gives a view of the H-mesh used in this test case. 
Figs. 7 and 8 are Mach and pressure contour plots, respectively, of the 
flowfield resolved by the flow code. The qualitative agreement repre- 
sented in these figures is good. No quantitative solutions are presented 
here because the final solution of the modified BMULE code is the same 
as that of the original code, whose validity was established in Ref. 2. 
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Conclusions 


Several modifications were made to an operational implicit upwind 
finite volume computer algorithm. These include: ribbon vector storage, 
two-factor scheme, multigrid, diagonal plane solution path, etc. The 
goal of improving computational efficiency has been modestly achieved, 
although more improvement is necessary. The present computer code oper- 
ates 5 times more efficiently than it did previously. Advanced CFD pro- 
cedures have been integrated to produce a robust, accurate and efficient 
algorithm with the potential to provide advanced three-dimensional flow 
analysis of rotating machinery. 

Future emphasis should continue to be focused toward improving com- 
putational efficiency. At the present rate of improvement, the analysis 
of rotating machinery flowfields is well within view. Once the effi- 
ciency is achieved, then and only then, should detailed flow studies be 
expected. 

The modified version of the BMULE code presently resides on the 
NASA Marshall CRAY X-MP. 
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Table la. 


Flow Trace September 1985 


2 



ROUTINE 

TIME 

% 

CALLED 

AVERAGE T 

1 

BMULEWR 

0.306154 

0.70 

1 

0.306154 

2 

IC 

0.001208 

0.00 

1 

0.001208 

3 

METRIC 

0.010898 

0.02 

1 

0.010898 

4 

BC 

0.275058 

0.63 

101 

0.002723 

5 

STEP 

0.351348 

0.80 

100 

0.003513 

6 

FLUX 

6.024503 

13.72 

100 

0.060245 

7 

EIGENV 

0.337293 

0.77 

100 

0.003373 

8 

AE 

5.961722 

13.58 

200 

0.029809 

9 

DOOIF 

4.540607 

10.34 

100 

0.045406 

10 

DOOIB 

4.611240 

10.50 

100 

0.046112 

11 

BE 

6.575230 

14.97 

200 

0.032876 

12 

DOOJF 

2.043788 

4. 65 

100 

0.020438 

13 

DOOJB 

2. 129168 

4.85 

100 

0.021292 

14 

CE 

6.588037 

15.00 

200 

0.032940 

15 

DOOKF 

2.021897 

4.60 

100 

0.020219 

16 

DOOKB 

2. 129334 

4.85 

100 

0.021293 

17 

PVAR 

0.007076 

0.02 

1 

0.007076 

*** 

TOTAL 

43.914561 




•kic-k 

OVERHEAD 

0.051794 





Table 

lb. Flow Trace September 1986 


ROUTINE 

TIME 

% 

CALLED 

AVERAGE T 

1 BULLYII 

0.00021 3 

0.00 

1 

0.000213 

2 POINTER 

0.000077 

0.00 

1 

0.000077 

3 SETMEM 

0.007671 

0.08 

1 

0.007671 

4 INDEX 

0.000360 

0.00 

102 

0.000004 

5 IC 

0.000888 

0.01 

1 

0.000888 

6 GRID 

0.017251 

0.19 

1 

0.017251 

7 METRIC 

0.009412 

0.10 

1 

0.009412 

Q DP 

KJ UU 

i . 1 46532 

12.51 

201 
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Fig. 2 Residual 
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Fig. 6 Typical Cascade H-Mesh 
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Fig. 8 Pressure Contours 



